Microsoft Research at DUC2006: Task-Focused Summarization with Sentence Simplification and Lexical Expansion

نویسندگان

  • Lucy Vanderwende
  • Hisami Suzuki
  • Chris Brockett
چکیده

Our DUC2006 system comprised three main components: a task-focused extractive summarization system, sentence simplification, and lexical expansion of topic words. This paper details each of these components, together with experiments designed to quantify their individual contributions. We include an analysis of our results according to two independent human evaluation methods, the NIST evaluation and the Pyramid evaluation. Our system ranked first in terms of both overall mean score and averaged per-cluster mean ranking out of 22 systems in the Pyramid evaluation, and ranked third out of 35 systems in NIST content responsiveness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion

In recent years, there has been increased interest in topic-focused multi-document summarization. In this task, automatic summaries are produced in response to a specific information request, or topic, stated by the user. The system we have designed to accomplish this task comprises four main components: a generic extractive summarization system, a topic-focusing component, sentence simplificat...

متن کامل

Fudan University at DUC 2006

This paper describes our work in query-based multi-document summarization task in DUC2006. We present the system overview, focusing on the newly developed techniques, including a new method of sentence similarity calculation, and application of anaphoric resolution to improve the readability of the summary. Evaluation results from NIST are also given and analyzed.

متن کامل

JU_NLP at SemEval-2016 Task 11: Identifying Complex Words in a Sentence

The complex word identification task refers to the process of identifying difficult words in a sentence from the perspective of readers belonging to a specific target audience. This task has immense importance in the field of lexical simplification. Lexical simplification helps in improving the readability of texts consisting of challenging words. As a participant of the SemEval-2016: Task 11 s...

متن کامل

ISCAS at DUC 2006

This paper describes the architecture of the summarization system IS_SUM from Institute of Software, Chinese Academy of Sciences for DUC2006. The improvements on lexical chain algorithm are given in detail in order to enhance its efficiency and adapt it to query based summarization. We conclude our paper with the different evaluation results and the very primary analysis.

متن کامل

On the Effectiveness of using Sentence Compression Models for Query-Focused Multi-Document Summarization

This paper applies sentence compression models for the task of query-focused multi-document summarization in order to investigate if sentence compression improves the overall summarization performance. Both compression and summarization are considered as global optimization problems and solved using integer linear programming (ILP). Three different models are built depending on the order in whi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006